Sturgeon County
Introducing the A2AJ's Canadian Legal Data: An open-source alternative to CanLII for the era of computational law
The Access to Algorithmic Justice project (A2AJ) is an open-source alternative to the Canadian Legal Information Institute (CanLII). At a moment when technology promises to enable new ways of working with law, CanLII is becoming an impediment to the free access of law and access to justice movements because it restricts bulk and programmatic access to Canadian legal data. This means that Canada is staring down a digital divide: well-resourced actors have the best new technological tools and, because CanLII has disclaimed leadership, the public only gets second-rate tools. This article puts CanLII in its larger historical context and shows how long and deep efforts to democratize access to Canadian legal data are, and how often they are thwarted by private industry. We introduce the A2AJ's Canadian Legal Data project, which provides open access to over 116,000 court decisions and 5,000 statutes through multiple channels including APIs, machine learning datasets, and AI integration protocols. Through concrete examples, we demonstrate how open legal data enables courts to conduct evidence-based assessments and allows developers to create tools for practitioners serving low-income communities.
Leveraging Large Language Models for Relevance Judgments in Legal Case Retrieval
Ma, Shengjie, Chen, Chong, Chu, Qi, Mao, Jiaxin
Collecting relevant judgments for legal case retrieval is a challenging and time-consuming task. Accurately judging the relevance between two legal cases requires a considerable effort to read the lengthy text and a high level of domain expertise to extract Legal Facts and make juridical judgments. With the advent of advanced large language models, some recent studies have suggested that it is promising to use LLMs for relevance judgment. Nonetheless, the method of employing a general large language model for reliable relevance judgments in legal case retrieval is yet to be thoroughly explored. To fill this research gap, we devise a novel few-shot workflow tailored to the relevant judgment of legal cases. The proposed workflow breaks down the annotation process into a series of stages, imitating the process employed by human annotators and enabling a flexible integration of expert reasoning to enhance the accuracy of relevance judgments. By comparing the relevance judgments of LLMs and human experts, we empirically show that we can obtain reliable relevance judgments with the proposed workflow. Furthermore, we demonstrate the capacity to augment existing legal case retrieval models through the synthesis of data generated by the large language model.
ChatGPT Alternative Solutions: Large Language Models Survey
Alipour, Hanieh, Pendar, Nick, Roy, Kohinoor
In recent times, the grandeur of Large Language Models (LLMs) has not only shone in the realm of natural language processing but has also cast its brilliance across a vast array of applications. This remarkable display of LLM capabilities has ignited a surge in research contributions within this domain, spanning a diverse spectrum of topics. These contributions encompass advancements in neural network architecture, context length enhancements, model alignment, training datasets, benchmarking, efficiency improvements, and more. Recent years have witnessed a dynamic synergy between academia and industry, propelling the field of LLM research to new heights. A notable milestone in this journey is the introduction of ChatGPT, a powerful AI chatbot grounded in LLMs, which has garnered widespread societal attention. The evolving technology of LLMs has begun to reshape the landscape of the entire AI community, promising a revolutionary shift in the way we create and employ AI algorithms. Given this swift-paced technical evolution, our survey embarks on a journey to encapsulate the recent strides made in the world of LLMs. Through an exploration of the background, key discoveries, and prevailing methodologies, we offer an up-to-the-minute review of the literature. By examining multiple LLM models, our paper not only presents a comprehensive overview but also charts a course that identifies existing challenges and points toward potential future research trajectories.
Towards Maximizing the Representation Gap between In-Domain \& Out-of-Distribution Examples
Nandy, Jay, Hsu, Wynne, Lee, Mong Li
Among existing uncertainty estimation approaches, Dirichlet Prior Network (DPN) distinctly models different predictive uncertainty types. However, for in-domain examples with high data uncertainties among multiple classes, even a DPN model often produces indistinguishable representations from the out-of-distribution (OOD) examples, compromising their OOD detection performance. We address this shortcoming by proposing a novel loss function for DPN to maximize the \textit{representation gap} between in-domain and OOD examples. Experimental results demonstrate that our proposed approach consistently improves OOD detection performance.
Should Artificial Intelligence Be Regulated? Issues in Science and Technology
Rapid advances in computing and robotics have led to calls for government controls. Before acting, we need to distinguish among the many meanings and applications of the technology. New technologies often spur public anxiety, but the intensity of concern about the implications of advances in artificial intelligence (AI) is particularly noteworthy. Several respected scholars and technology leaders warn that AI is on the path to turning robots into a master class that will subjugate humanity, if not destroy it. Others fear that AI is enabling governments to mass produce autonomous weapons--"killing machines"--that will choose their own targets, including innocent civilians. Renowned economists point out that AI, unlike previous technologies, is destroying many more jobs than it creates, leading to major economic disruptions. There seems to be widespread agreement that AI growth is accelerating.
LegalAIIA Workshop To Explore Artificial Intelligence and Intelligent Assistance H5
The First International Workshop on AI and Intelligent Assistance for Legal Professionals in the Digital Workplace (LegalAIIA) will be held at the Cyberjustice Laboratory at the University of Montreal on June 17th. This workshop is part of the 17th International Conference on AI and Law (ICAIL), a biennial conference which has served as an important forum at the intersection the AI and the law since its founding in 1987. The LegalAIIA workshop itself is an offshoot of the successful decade-long DESI (Discovery for Electronically Stored Informed) workshop series, which was pivotal in helping forge an interdisciplinary community of legal and technical practitioners working on advancing the state-of-the-art in electronic discovery practice. The first edition of Legal AIIA, driven by an impressive set of electronic discovery veterans including Jack G. Conrad (Thomson Reuters), Jeremy Pickens (Catalyst Repository Systems), Amanda Jones (H5), Hans Henseler (Magnet Forensics), and Jason R. Baron (Drinker, Biddle & Reath), aims to tackle head on the issue of human-AI collaboration. Accepted papers will focus on evaluating when and how to best leverage a "human-in-the-loop" approach to AI.
Predictive Uncertainty Estimation via Prior Networks
Estimating how uncertain an AI system is in its predictions is important to improve the safety of such systems. Uncertainty in predictive can result from uncertainty in model parameters, irreducible \emph{data uncertainty} and uncertainty due to distributional mismatch between the test and training data distributions. Different actions might be taken depending on the source of the uncertainty so it is important to be able to distinguish between them. Recently, baseline tasks and metrics have been defined and several practical methods to estimate uncertainty developed. These methods, however, attempt to model uncertainty due to distributional mismatch either implicitly through \emph{model uncertainty} or as \emph{data uncertainty}. This work proposes a new framework for modeling predictive uncertainty called Prior Networks (PNs) which explicitly models \emph{distributional uncertainty}. PNs do this by parameterizing a prior distribution over predictive distributions. This work focuses on uncertainty for classification and evaluates PNs on the tasks of identifying out-of-distribution (OOD) samples and detecting misclassification on the MNIST and CIFAR-10 datasets, where they are found to outperform previous methods. Experiments on synthetic and MNIST and CIFAR-10 data show that unlike previous non-Bayesian methods PNs are able to distinguish between data and distributional uncertainty.
AI Application in Insurance Industry - The Fintech Times
Many insurers are investing in AI beyond MLwhich is one of its subfield. Opportunities range from an enhanced Customer Experience (reduced cycle time, personalized advisors through chatbots, fast track Claims management), to productivity efficiency, pricing sophistication, churn risk anticipation and accurate Fraud detection patterns. Insurers can either build internal capabilities, partner with start-ups on these fields or do both to accelerate time to market impacts. AI is a great enabler. Nevertheless, the right balance between human contacts and AI is key.